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Abstract 

We compare the probability distribution of returns for the three major stock-market 
indices (Nasdaq, S&P500, and Dow-Jones) with an analytical formula recently de- 
rived by Dragulescu and Yakovenko for the Heston model with stochastic variance. 
For the period of 1982-1999, we find a very good agreement between the theory 
and the data for a wide range of time lags from 1 to 250 days. On the other hand, 
deviations start to appear when the data for 2000-2002 are included. We interpret 
this as a statistical evidence of the major change in the market from a positive 
growth rate in 1980s and 1990s to a negative rate in 2000s. 
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1 Introduction 

Models of multiplicative Brownian motion with stochastic volatility have 
been a subject of extensive studies in finance, particularly in relation with 
option pricing [1]. One of the popular models is the so-called Heston model 
[2], for which many exact mathematical results can be obtained. Recently, 
Dragulescu and Yakovenko (DY) [3] derived a closed analytical formula for the 
probability distribution function (PDF) of log-returns in the Heston model. 
They found an excellent agreement between the formula and the empirical 
data for the Dow- Jones index for the period of 1982-2001. (Discussion of 
other work on returns distribution and references can be found in Ref. [3].) 

In the present paper, we extend the comparison by including the data for 
Nasdaq and S&P500. We find that the DY formula agrees very well with the 
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data for the period of 1982-1999. However, when the data for 2000-2002 are 
included, systematic deviations are observed, which reflect a switch of the 
market from upward to downward trend around 2000. 



2 Probability distribution of log-returns in the Heston model 

In this Section, we briefly summarize the results of the DY paper [3]. Let us 
consider a stock, whose price St, as a function of time t, obeys the stochastic 
differential equation of multiplicative Brownian motion: 

dS t = nS t dt + a t S t dW t {1) . (1) 

Here the subscript t indicates time dependence, \i is the drift parameter, 
is a standard random Wiener process, and a t is the time-dependent volatility. 
Changing the variable in (1) from price S t to log-return r t = \n(S t /S ) and 
eliminating the drift by introducing x t = r t — pt, we find: 

dx t = -^dt + y^ t dW t {1) . (2) 

where Vt = is the variance. 

Let us assume that the variance v t obeys the following mean-reverting 
stochastic differential equation: 

dv t = -i(v t -e)dt + K^r t dw t {2) . (3) 

Here 9 is the long-time mean of v, 7 is the rate of relaxation to this mean, 
W^ 1 is a standard Wiener process, and k is the variance noise. In general, the 
Wiener process in (3) may be correlated with the Wiener process in (1): 

dW t {2) = pdW t {1) + ^l-p*dZ t , (4) 

where Z t is a Wiener process independent of W^, and p G [—1,1] is the 
correlation coefficient. 

The coupled stochastic processes (2) and (3) constitute the Heston model 
[2]. In a standard manner [4], the Fokker-Planck equation can be derived for 
the transition probability P t (x,v \ vi) to have log-return x and variance v at 
time t given the initial log-return x = and variance Vi at t — 0: 

l^i^Pi + iJV) (5) 
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A general analytical solution of Eq. (5) for P t (x, v \ vi) was obtained in Ref. [3]. 
Then Pt(x, v \vi) was integrated over the final variance v and averaged over 
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the stationary distribution H*(vi) of the initial variance vf. 

oo oo 

Pt(x) = J dvi J dvP t (x,v I vi) U*{yi). 



(6) 



The function Pt(x) in Eq. (6) is the PDF of log-returns x after the time lag 
t. It can be direct compared with financial data. It was found in Ref. [3] that 
data fits are not very sensitive to the parameter p, so below we consider only 
the case p = for simplicity. 

The final expression for P t (x) at p = (the DY formula [3]) has the form 
of a Fourier integral: 
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In the long-time limit t ^> 2, Eqs. (7) and (8) exhibit scaling behavior, i.e. 
Pt{x) becomes a function of a single combination z of the two variable x and 
t (up to the trivial normalization factor N t and unimportant factor e~ x l 2 ): 



P t (x) = N t e- x/2 P*(z), P*(z) = K 1 (z)/z, z = ^Jx 2 + i 2 , 
i=at/2 = te/xl, N t = iJ/nxo, 

where K\{z) is the first-order modified Bessel function. 



(10) 
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3 Comparison between the DY theory and the data 

We analyzed the data for the three major stock-market indices: Dow- Jones, 
S&P500, and Nasdaq. From the Yahoo Web site [5], we downloaded the daily 
closing values of Dow- Jones and S&P500 from 4 January 1982 to 22 October 
2002 and all available data for Nasdaq from 11 October 1984 to 22 October 
2002. The downloaded time series {SV} are shown in the left panel of Fig. 1. 
It is clear that during 1980s and 1990s all three indices had positive exponen- 
tial growth rates, followed by negative rates in 2000s. For comparison, in the 
right panel of Fig. 1, we show the time series from 1930 to 2002. Contrary to 
the mutual-funds propaganda, stock market does not always increase. During 
1930s (Great Depression) and 1960s-1970s (Stagnation), the average growth 
rate was zero or negative. One may notice that such fundamental changes of 
the market trend occur on a very long time scale of the order of 15-20 years. 

Using the procedure described in Ref. [3], we extract the PDFs p^ data ^( r ) 
of log-returns r for different time lags t from the time series {S T } for all 
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Stock indices data, 01/04/1 982-1 0/22/2002 



Stock indices data, 01/02/1930-10/22/2002 
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Fig. 1. Historical evolution of the three major stock-market indices, shown in the 
log-linear scale. The Nasdaq curve is shifted up by the factor of 1.5 for clarity. The 
vertical line separates the regions with the average positive and negative growth rates. 

three indices. In the DY theory [3], the actual (empirically observed) growth 
rate ft is related to the bare parameter \i by the following relation: ft — \i — 
6/2, and Pj; da ' ta '\x) is obtained by replacing the argument r — ► x + ftt. The 
parameters ft were found by fitting the time series in the left panel of Fig. 1 
to straight lines. With the constraint /i = ft + 6/2, the other parameters of the 
Heston model (7, 6, k) were obtained by minimizing the mean-square deviation 
J2 x ,t I lnP/ data ^(o;) — lnP t (x)\ 2 between the empirical data and the DY formula 
(7) and (8), with the sum taken over all available x and the time lags t — 1, 5, 
20, 40, and 250 days. This procedure was applied to the data from 1982 (1984 
for Nasdaq) to 31 December 1999, and the values of the obtained parameters 
are shown in Table 1. The model parameters for Dow- Jones and S&P500 
are similar, whereas some parameters for Nasdaq are significantly different. 
Namely, the variance relaxation time I/7 is much shorter, the variance noise 
k is much bigger, and the parameter a is much smaller for Nasdaq. All of 
this is consistent with the general notion that Nasdaq is more volatile than 
Dow- Jones and S&P500. On the other hand, the average growth rates ft of all 
three indices are about the same, so the greater risk in Nasdaq does not result 
in a higher average return. 

Table 1 

Parameters of the Heston model obtained from the fits of the Nasdaq, S&P500, and 
Dow-Jones data from 1982 to 1999 using p = for the correlation coefficient. 
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Nasdaq data, 10/11/1 984-1 2/31/1 999 



Nasdaq data, 10/11/1 984-1 2/31/1 999 
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Fig. 2. Comparison between the 1984-1999 Nasdaq data (points) and the 
Dragulescu-Yakovenko theory [3] (curves). Left panel: PDFs Pt(x) of log-returns x 
for different time lags t shifted up by the factor of 10 each for clarity. Right panel: 
Renormalized PDF P t (x)e x l 2 /N t plotted as a function of the scaling argument z given 
in Eq. (10). The solid line is the scaling function P*(z) = K\{z)j z from Eq. (10), where 
K\ is the first-order modified Bessel function. 

Fig. 2 compares the 1984-1999 data for Nasdaq (points) with the DY theory 
(curves). The left panel shows the PDFs Pt(x) (7) for several time lags t, and 
the right panel demonstrates the scaling behavior (10). The overall agreement 
is quite good. Particularly impressive is the scaling plot, where the points for 
different time lags collapse on a single nontrivial scaling curve spanning 10 (!) 
orders of magnitude. On the other hand, when we include the data up to 22 
October 2002, the points visibly run off the theoretical curves, as shown in 
Fig. 3. We use the same values of the parameters (/i, 7, 9, k) in Fig. 3 as in Fig. 
2, because attempts to adjust the parameters do not reduce the discrepancy 
between theory and data. The origin of the discrepancy is discussed in Sec. 4. 

Similarly to Nasdaq, the S&P500 data for 1982-1999 agree well with the 
theory, as shown in Fig. 4. However, when the data up to 2002 are added 
(Fig. 5), deviations occur, albeit not as strong as for Nasdaq. For Dow- Jones 
1982-1999 (Fig. 6), the data agrees very well with the theory. The PDFs for 
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Nasdaq data. 1 0/1 1 11 984-1 0/22/2002 
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Fig. 3. The same as in Fig. 2 for 1984-2002. 
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S&P500 data, 01/04/1982-12/31/1999 S&P500 data, 01/04/1982-12/31/1999 
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Fig. 4. The same as in Fig. 2 for S&P500 for 1982-1999. 

1982-2002, shown in the left panel of Fig. 7, still agree with the theory, but 
deviations are visible in the scaling plot in the right panel of Fig. 7. They 
come from the time lags between 40 and 150 days not shown in the left panel. 



4 Discussion and conclusions 

We conclude that, overall, the PDFs of log-returns, Pt(x), agree very well 
with the DY formula [3] for all three stock-market indices for 1982-1999. It is 
important to recognize that the single DY formula (7) and (8) fits the whole 
family of empirical PDFs for time lags t from one day to one year (equal to 
252.5 trading days). The agreement with the nontrivial Bessel scaling function 
(10) extends over the astonishing ten orders of magnitude. These facts strongly 
support the notion that fluctuations of stock market are indeed described by 
the Heston stochastic process. 

On the other hand, once the data for 2000s are included, deviations appear. 
They are the strongest for Nasdaq, intermediate for S&P500, and the smallest 
for Dow- Jones. The origin of the deviations can be recognized by looking in 
Fig. 1. Starting from 2000, Nasdaq has a very strong downward trend, yet we 
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S&P500 data, 01/04/1982-10/22/2002 
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Fig. 5. The same as in Fig. 4 for 1982-2002. 
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Dow-Jones data, 01/04/1982-12/31/1999 



Dow-Jones data, 01/04/1982-12/31/1999 
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Fig. 6. The same as in Fig. 4 for Dow-Jones. 

are trying to fit the data using a constant positive growth rate \i. Obviously, 
that would cause disagreement. For S&P500 and Dow- Jones, the declines in 
2000s are intermediate and small, so are the deviations from the DY formula. 
We think these deviations are not an argument against the Heston model. 
They rather indicate the change of /i from a positive to a negative value around 
2000. Our conclusion about the change of regime is based on the statistical 
properties of the data for the last 20 years. The situation is very different 
from the crash of 1987. As our plots show, the crash of 1987 did not have 
significant statistical impact on the PDFs of log-returns for 1980s and 1990s, 
because the market quickly recovered and resumed overall growth. Thus, the 
crash of 1987 was just a fluctuation, not a change of regime. To the contrary, 
the decline of 2000s (which is characterized by a gradual downward slide, not 
a dramatic crash on any particular day) represents a fundamental change of 
regime, because the statistical probability distributions have changed. These 
conclusions are potentially important for investment decisions. 

The average growth rate n is an exogenous parameter in the Heston model 
and is taken to be constant only for simplicity. In a more sophisticated model, 
it could be a smooth function of time r, reflecting the long-term trend of 
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Fig. 7. The same as in Fig. 6 for 1982-2002. 
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the market of the scale of 15-20 years. Using a properly selected function /x r , 
one could attempt to analyze the stock-market fluctuations on the scale of a 
century. That would be the subject of a future work. 

We are grateful to Adrian Dragulescu for help and sharing his computer 
codes for data processing and plotting. 
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